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Abstract 

We investigate the worst-case behavior of the simplex algorithm on linear pro- 
grams with three variables, that is, on 3-dimensional simple polytopes. Among 
the pivot rules that we consider, the "random edge" rule yields the best asymp- 
totic behavior as well as the most complicated analysis. All other rules turn out to 
be much easier to study, but also produce worse results: Most of them show essen- 
tially worst-possible behavior; this includes both Kalai's "random-facet" rule, which 
without dimension restriction is known to be sub exponential, as well as Zadeh's de- 
terministic history-dependent rule, for which no non-polynomial instances in general 
dimensions have been found so far. 



1 Introduction 

The simplex algorithm is a fascinating method for at least three reasons: For computa- 
tional purposes it is still the most efficient general tool for solving linear programs, from 
a complexity point of view it is the most promising candidate for a strongly polynomial 
time linear programming algorithm, and last but not least, geometers are pleased by its 
inherent use of the structure of convex polytopes. 

The essence of the method can be described geometrically: Given a convex polytope P 
by means of inequalities, a linear functional ip "in general position," and some vertex Wstart, 
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the simplex algorithm chooses an edge to a neighboring vertex along which decreases 
strictly. Iterating this yields a (/^-monotone edge-path. Such a path can never get stuck, 
and will end at the unique <y9-minimal ("optimal") vertex of P. 

Besides implementational challenges, a crucial question with respect to efficiency asks 
for a suitable pivot rule that prescribes how to proceed with the monotone path at any 
vertex. Since Dantzig invented the simplex algorithm in the late 1940's j3], a great variety 
of pivot rules have been proposed. Most of them (including Dantzig's original "largest 
coefficient rule" ) have subsequently been shown to lead to exponentially long paths in the 
worst case. (See pQ for a survey.) Prominent exceptions are Zadeh's history-dependent 
"least entered" rule, and several randomized pivot rules. Particularly remarkable is the 
"random facet" rule proposed by Kalai [H]; its expected path length for all instances is 
bounded subexponentially in the number of facets. See also Matousek et al. |14j . 

In this paper, we analyze the worst-case behavior of the simplex method on 3-dimen- 
sional simple polytopes for some well-known pivot rules. At first glance, the 3-dimensional 
case may seem trivial, since by Euler's formula a 3-polytope with n facets has at most 
2?7, — 4 vertices (with equality if and only if the polytope is simple), and there are examples 
where n — 3 steps are needed for any monotone path to the optimum (see, e.g.. Figure Q). 
Therefore, for any pivot rule the simplex algorithm is linear, with at least n — 3 and at 
most 2n — 5 steps in the worst case. However, no pivot rule is known that would work 
with at most n — 3 steps. 

In order to summarize our results, we define the following measure of quality. Fix a 
pivot rule TZ. For every 3-dimensional polytope P C and for every linear functional 
ip : — > M in general position with respect to P (i.e., no two vertices of P have 
the same value), denote by A7^(P, f start) the path length (expected path length, if TZ is 
randomized) produced by the simplex algorithm with the pivot rule TZ, when started at 
vertex Wstart- The linearity coefficient of TZ is 

A{TZ) := lim sup ( '^^^"^'!^f'""''^ : P, v?, Wstart as above), 

n{P)^oo ^ n[P) ) 

where n{P) is the number of facets of P. With the usual simplifications for a geometric 
analysis (cf. ^2]; 1201 Lect. 3], jT]), we may restrict our attention to simple 3-dimensional 
polytopes P (where each vertex is contained in precisely 3 facets). So we only consider 
3-dimensional polytopes P, with n = n{P) facets, 3n — 6 edges, and 2r2 — 4 vertices. 
By the discussion above, the linearity coefficient satisfies 1 < A(7^) < 2 for every pivot 
rule TZ. 

The most remarkable aspect of the picture that we obtain, in Section IHl is that the 
"random edge" rule ("RE" for short) performs quite well (as it is conjectured for general 
dimensions), but it is quite tedious to analyze (as it has already been observed for general 
dimensions). The following bounds for the random edge rule 

1.3473 < A(RE) < 1.4943 

are our main results. Thus we manage to separate A (RE) from the rather easily achieved 
lower bound of |, as well as from the already non-trivial upper bound of |. 

On the other hand, in Section 0] we prove that the linearity coefficient for the "greatest 
decrease" pivot rule is A(GD) = |, while many other well-known rules have linearity 
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coefficient A = 2, including the largest coefficient, least index, steepest decrease, and 
the shadow vertex rules, as well as Zadeh's history-dependent least entered rule (not 
known to be super-polynomial in general), and Kalai's random facet rule (known to be 
sub-exponential in general). 

2 Basics 

Klee ^21 proved in 1965 that the "monotone Hirsch conjecture" is true for 3-dimensional 
polytopes, that is, whenever the graph of a 3-dimensional polytope P with n facets is 
oriented by means of a linear functional in general position there is a monotone path of 
length at most n — 3 from any vertex to the sink Wmin- (See Klee & Kleinschmidt ^21 for a 
survey of the Hirsch conjecture and its ramifications.) Unfortunately, Klee's proof is not 
based on a pivot rule. 

Theorem 2.1 (Klee |12] ). For any simple 3-polytope P C M'^, a linear functional if : 
M'^ — > M in general position for P, and any vertex Vstart of P, there is a if -monotone 
path from Vstart to the ip-minimal vertex v^^^ of P that does not revisit any facet. 

In particular, there is a ip-monotone path from v start to fmin of length at most n — 3. 

It is not too hard to come up with examples showing that the bound provided by 
Theorem 12. II is best possible. One of the constructions will be important for our treatment 
later on, so we describe it below in Figure ^ 

A particularly useful tool for constructing LP-oriented 3-polytopes is the following 
result due to Mihalisin and Klee. It is stated in a slightly weaker version in their paper, 
but their proof actually shows the following. 

Theorem 2.2 (Mihalisin &i Klee jTTj). Let G = {V, E) be a planar 3- connected graph, 
f : V — > M any infective function, and denote by G the acyclic oriented graph obtained 
from G by directing each edge to its endnode with the smaller f -value. Then the following 
are equivalent: 

1. There exist a polytope P CM.^ and a linear functional ip : — > R in general position 
for P, such that G is isomorphic to the graph of P and, for every v G V , f{v) agrees 
with the If -value of the vertex of P corresponding to v. 

2. Both (a) and (b) hold: 

(a) G has a unique sink in every facet (induced non-separating cycle) of G, and 

(b) there are three node-disjoint monotone paths joining the (unique) source to the 
(unique) sink ofG. 

Here the fact that the source and the sink of G are unique (referred to in condition (b)) 
follows from (a); cf. Joswig et al. [S]. Equipped with Theorem 12.21 one readily verifies 
that the family of directed graphs indicated in Figure (n > 4) can be realized as convex 
3-polytopes, with associated linear functionals, demonstrating that Klee's bound of n — 3 
on the length of a shortest monotone path cannot be improved. 
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Figure 1: A worst case example for Klee's theorem, starting at Vns (and for Bland's 
rule, starting at V2n-6] see Section l^?T|) . All edges are oriented from left to right. 



At any non-optimal vertex, the random edge pivot rule takes a step to one of its improving 
neighbors, chosen uniformly at random. Thus the expected number E{v) of steps that 
the random edge rule would take from a given vertex v to the optimal one f min may be 
computed recursively as 



where S"^\v) denotes the set of edges that leave v (that is, lead to better vertices), so that 
I 5°^\v)\ is the number of neighbors of v whose yp-value is smaller than that of v. 

Despite its simplicity and its (deceptively) simple recursion, this rule has by now re- 
sisted several attempts to analyze its worst-case behavior, with a few exceptions for special 
cases, namely linear assignment problems (Tovey jTHI), the Klee-Minty cubes (Kelly [TT] . 
Gartner et al. [0]), and (i-dimensional linear programs with at most d + 2 inequalities 
(Gartner et al. [7j). All known results leave open the possibility that the expected num- 
ber of steps taken by the random edge rule on a (i-dimensional linear program with n 
inequalities could be bounded by a polynomial, perhaps even by O(n^) or 0{dn), where 
n is the number of facets. 

However, Matousek and Szabo [1^] recently showed that the random edge rule does 
not have a polynomially bounded running time on the larger class of acyclic unique sink 
orientations (AUSO^s), i.e., acyclic orientations of the graph of a polytope that induce 
unique sinks in all non-empty faces (cf. condition 2(a) in Theorem 12. 2j) . They exhibited 
particular AUSO's on d-dimensional cubes for which random edge needs at least const • 

2const.dl/3 

3.1 Lower Bounds 

The lower bound calculations appear to be much simpler if we do not use the recursion 
given above, but instead use a "flow model." For this, fix a starting vertex fstart, and 
denote by p{v) the probability that the vertex v will be visited by a random edge path 
from fstart to Wmin, and similarly by p{e) the probability that a directed edge e will be 
traversed. Then the probability that a vertex v is visited is the sum of the probabilities 



3 The Random Edge Rule 




(1) 
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Figure 2: Lower bound construction for the random edge rule: The backbone polytope. 
All edges are oriented from left to right. 



that the edges leading into v are traversed, 

if V is not the starting vertex. (Here denotes the set of edges that enter v.) Fur- 

thermore, by definition of the random edge rule we have 

Pie) = ^^P(^) foralleG5°» (2) 

at each non-optimal vertex. The random edge rule thus induces a flow (^p{e))^^^ of value 1 
from t'start to fjnin- The expected path length E(i;start) is then given by 

E(i;start) = $^p(e), (3) 
and we refer to it as the cost of the flow (^(e))^^^. 

Theorem 3.1. The linearity coefficient of the random edge rule satisfies 

A(RE) > Wfs > 1-3473. 

Proof. We describe a family of LPs which show the above lower bound on the linearity co- 
efficient. We start with the graph of the dual-cyclic polytope Csi^k)^ with the orientation 
depicted in Figure El and refer to this as the backbone of the construction. 

Starting at the vertex V2k-7, the simplex algorithm will take the path along the k — 2 
vertices V2k-7,V2k-9, ■ ■ ■ ,V3,vi,V(). Replacing each vertex in the path by a copy of the 
digraph depicted in Figure El — called a configuration in the following — yields the 
desired LP. The corresponding feasible polytope can be constructed explicitly by applying 
10 suitable successive vertex cuts at each vertex Vi of the backbone. Alternatively, one 
can check that the orientations we get satisfy the conditions of Theorem 12.21 

The maximal and minimal vertex of each configuration are visited with probability 1. 
We send 128 units of fiow (each of value through each configuration according to 
see Figure 01 This yields the fiow-cost of -j^ for each of the k — 2 configurations. (The 
last configuration produces fiow-cost of — 1 only, as it does not have a leaving edge.) 

We take the maximal vertex of the configuration at V2k-7 as the starting vertex f start- 
Using equation Q we obtain for the expected cost E(t>start): 

E(tWt) = (A:-2)^-l. 
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Figure 3: Lower bound construction for the random edge rule: The configuration. All 
edges are oriented from left to right. The target of the rightmost edge (not shown) is the 
starting node of the next configuration. The middle dotted edge enters the configuration 
from the corresponding vertex of the top backbone row. The actual fiow at each edge is 
1/128 times the number written next to the edge. 



With n = k + 10{k - 2) this yields 

T?(i, \ — "D—l 1897 _ 1 _ 1897 _ 5202 
I^yVst'Avt) — 11 ■ 128 ^ ~ 1408'^ 1408' 

which proves the lower bound. □ 

The configuration depicted in Figure |H1 was found by complete enumeration of the 
acyclic orientations satisfying condition (a) of Theorem 1221 (AUSOs) on 3-polytopes with 
n <12 facets. In particular, our proof of Theorem 13.11 includes a worst-possible example 
for n = 12. We refer to Mechtel ^H] for more details of the search procedure, as well as 
for a detailed analysis of properties of worst-case examples for the random edge rule. 



3.2 Upper Bounds 

Theorem 3.2. The linearity coefficient of the random edge rule satisfies 

1 30 

A(RE) < < 1.4943. 

Proof. Consider any linear program on a simple 3-polytope with n facets, with a linear 
objective function 99 in general position. We will refer to the (/9-value of a vertex as its 
"height." A 1-vertex will denote a vertex with exactly one neighbor that is lower with 
respect to if. Similarly, a 2-vertex has exactly 2 lower neighbors. Consequently, from 
any 1-vertex the random edge rule proceeds deterministically to the unique improving 
neighbor, and from any 2-vertex it proceeds to one of the two improving neighbors, each 
with probability |. 

Basic counting yields that our LP has exactly {n — 3) 1-vertices and {n — 3) 2-vertices 
in addition to the unique maximal vertex Wmax and the unique minimal vertex Wmm; which 
have 3 and lower neighbors, respectively. For the following, we also assume that the 
vertices are sorted and labelled V2n-5, ■ ■ ■ ,Vi,Vq in decreasing order of their objective 
function values, with Vma.x = V2n-5 and Wmin = ^o- 

For any vertex v, let Ni{v) (resp., N2{v)) denote the number of 1-vertices (resp., 2- 
vertices) that are not higher than v (including v itself). Put N{v) = Ni{v) + N2{v). For 
all vertices v other than the maximal one this is the number of vertices lower than v, that 
is, N{vi) = i for all i ^ 2n — 5. 
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We will establish the following generic inequality: 

E{v) < aNi{v) + (3N{v) (4) 

Here, a and /3 are constants whose values will be fixed later. 

The proof of (j3)) will proceed by induction on N{v). The inductive step will be 
subdivided into 24 distinct cases. Each case depends on a linear inequality on a and /3 
that, when satisfied, justifies the induction step in that case. Since our case analysis is 
complete, we have a proof of (jU for any pair that satisfies all the 24 inequalities. 

Because we always have Ni{v), N2{v) < n — 3, we obtain 

E{v) < aNi{v) + (3{Ni{v) + N2{v)) = {a + p)Ni{v)+(3N2{v) < (a + 2/3)(n-3) 

for V 7^ I'max- The single vertex fmax is irrelevant for the asymptotic considerations. Thus 
we minimize a + 2/5 subject to the linear constraints posed by the various cases; this leads 
to an LP in two variables with 24 constraints, whose optimal solution is — (||, ||), 

of value ^ < 1.4943. This yields the upper bound on A(RE) stated in the theorem. 

We will now prove (jU by induction on N{v). The base case N{v) = is obvious, since 
V is the optimum in this case, and E(v) = 0. Suppose now that (H)) holds for all vertices 
lower than some vertex v. 

By an appropriate unwinding of the recursion we express E(f) in terms of the 
expected cost E{wi) of certain vertices Wi that are reachable from v via a few downward 
edges. The general form of such a recursive expression will be 

k 

E{v) = c + ^A,E(«;i), 

i=l 

where Aj > for z = 1, . . . , /c, and Aj = 1. 

Since we assume by induction that E(wj) < aNi{wi) + PN{wi), for each i, it suffices 
to show that 

k k 

^«A,(iVi(t;)-iViK)) + J]/?A,(iV(t;)-iV(mO) > c. 

i=l 1=1 

Write 

A,{w,) := N,{v) - iViK), A{w,) := N{v) - N{w,), 

for i = 1, . . . ,k. (These terms are defined with respect to the vertex v that is currently 
considered.) Here A[wi) is the distance between v and Wi, that is, one plus the number of 
vertices between v and Wi in the numbering of the vertices (f 2n-5, ■ ■ ■ ,vo) detailed above. 
Clearly A{wi) > Ai{wi) . 

We thus need to show that for each vertex v, 

k k 

a J]A,Ai(«;,)+/5^AiA(M;,) >c. (5) 

1=1 i=l 

At this point we start our case analysis. 
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Case 1: v is a. 1-vertex. 

Let Wi denote the target of the unique downward edge emanating from v as in the following 
figure, where (here and in all subsequent figures) each edge is labelled by the probability 
of reaching it from v. 

V I Wi 



In this case, E{v) = 1 + Fi{wi). In the setup presented above, we have Ai = 1, c = 1, 
^i(^i) > 1, and A{wi) > 1, thus (0) is implied by 

a + p>l. (6) 

Case 2: v is a. 2- vertex. 

Let Wi and W2 denote the targets of the two downward edges emanating from v, where 
W2 is lower than wi. 




We have 

Eiv) = l + ^EK) + ^E(u;2), 
hence we need to require that 

2^iK)+ 2^iM + |AK) + |AM > 1. 

Note that A{w2) > A{wi) > 1. 

Case 2. a: A{w2) > 4 (as in the preceding figure). 
Ignoring the effect of the Ai(wj)'s, it suffices to require that 

^A(^0 + ^A{W2) > 1, 

which will follow if 

13 > I (7) 

Case 2.b.i: A{w2) = 3 and one of the two vertices above W2 and below f is a 1-vertex. 
In this case Ai{w2) > 1 and A{wi) + A{w2) > 4, so (jSJ is implied by 

^a + 2(3>l. (8) 
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Case 2.b.ii: A{w2) = 3 and the two vertices between v and W2 are 2-vertices. Denote 
the second intermediate vertex as v'. We may assume that v' is reachable from v (that is, 
from wi), otherwise we can ignore it and reduce the situation to Case 2.c treated below 
(by choosing another ordering of the vertices producing the same oriented graph). Three 
subcases can arise. 

First, assume that none of the three edges that emanate from wi and v' further down 
reaches W2- Denote by x,y the two downward neighbors of v' and by z the downward 
neighbor of Wi other than v'. The vertices x, y, z need not be distinct (except that x ^ y\ 
but none of them coincides with W2- 

1/2 




We have here c = 7/4. 

To make the analysis simpler to follow visually, we present it in a table. Each row 
denotes one of the target vertices W2,x,y,z, 'multiplied' by the probability of reaching 
it from V. The left (resp., right) column denotes a lower bound on the corresponding 
quantities Ai(-) (resp., A(-)). To obtain an inequality that implies (2), one has to multiply 
each entry in the left (resp., right) column by the row probability times a (resp., times 
(3), and require that the sum of all these terms be > c. 





aAi 


/9A 


l/2W2 





3 


l/8x 





4 


1/82/ 





5 


l/Az 





4 



Note the following: (a) We do not assume that the rows represent distinct vertices (in 
fact, X = z is implicit in the table); this does not cause any problem in applying the rule 
for deriving an inequality from the table, (b) We have to squeeze the vertices so as to 
make the resulting inequality as sharp (and difficult to satisfy) as possible; thus we made 
one of X, y the farthest vertex, because making z the farthest vertex would have made the 
inequality easier to satisfy. 
We thus obtain 

'3 4 5 4\ ^ 7 



or 



(9) 



Next, assume that W2 is connected to v'. In this case W2 is a 1-vertex, and we extend 
the configuration to include its unique downward neighbor w^. 
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Let X denote the other downward neighbor of v' and let y denote the other downward 
neighbor of Wi. In the following table, the 'worst' case is to make ^3 and y coincide, and 
make x the farthest vertex. 



We then obtain 



or 



a 







PA 




1 


4 


l/8x 


1 


5 


1/Ay 


1 


4 


/20 


5 4^ 






8^4; 



19 



33^ 19 
a + —(3 > —. 



(10) 



Finally, assume that W2 is connected to wi. Here too W2 is a 1-vertex, and we extend 
the configuration to include its unique downward neighbor W3. 

1/2 




Denoting by x, y the two downward neighbors of v', our table and resulting inequality 
become 





aAi 


PA 




1 


4 


l/8x 


1 


4 


l/8y 


1 


5 



33^ ^ 5 



(11) 



which, by the way, is stronger than (jlOj) . 



Case 2.c: A{w2) = 2. Hence, the only remaining case is that Wi and W2 are the two 
vertices immediately following v. 
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Case 2.c.i: wi is a 1-vertex (whose other upward neighbor hes above v). Its unique 
downward edge ends at some vertex which is either W2 or hes below W2- 

Assume first that this vertex coincides with W2, which makes W2 a 1-vertex, whose 
unique downward neighbor is denoted as v'. The local structure, table, and inequality are 






aAi 


PA 


v' 


2 


3 



2a + 3f3> -. 



Suppose next that the downward neighbor of wi lies below W2- We get 
1/2 



5^ 3 




c = 3/2 





aAi 


/9A 


1/2W2 


1 


2 


l/2ws 


1 


3 



(12) 



(13) 



Case 2.c.ii: wi is a 2-vertex, both of whose downward neighbors lie strictly below W2- 
Denote these neighbors as w^, W/i, with lying above 1^4. 




We may assume that A{ws) = 3 (i.e., there is no vertex between W2 and W3), since 
the case A{ws) > 4 is already covered by ©. 



Case 2.c.ii.l: W2 is a 1-vertex. Then the table and inequality become 





aAi 


PA 


1/2m;2 





2 




1 


3 


1/4:W4 


1 


4 



1 11^ 3 



(14) 



Case 2.c.ii.2: W2 is a 2-vertex but W3 is a 1-vertex. Then W3 (which satisfies A{w3) = 3) 
is connected either to W2 or to a vertex above v. In the former case, let x denote the 
other downward neighbor of W2, and let y denote the unique downward neighbor of w^. 
The local structure looks like this (with x,y,W4 not necessarily distinct, but they all are 
below W3 due to A^ws) = 3): 
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c = 5/2 

The (worst) table and inequality are 






aAi 


PA 


l/4x 


1 


4 


1/21/ 


1 


4 


l/4u;4 


1 


5 



17^ ^ 5 
a H /3 > -. 



(15) 



The next case is where the other upward neighbor of lies above v. Let x, y denote 
the two downward neighbors of W2, and let z denote the unique downward neighbor of 
W3. (Again, x,y, z,W4 need not be distinct, but x ^ y and they all are below W3 due to 
A{wz) = 3.) The local structure is: 



c = 9/4 

The (worst) table and inequality become 






aAi 


PA 


l/4a; 


1 


4 


1/% 


1 


5 


1/42 


1 


4 




1 


5 



9^ ^ 9 



(16) 



Case 2.c.ii.3: Both W2 and are 2-vertices. We have to consider the following type 
of configuration (where x, y, z, t, W4 need not all be distinct, but x ^ y and z ^ t, and we 
may assume x t, y ^ z; also, because A^ws) = 3, both x and y are lower than W3): 




Intuitively, a worst table is obtained by 'squeezing' x, y, z, t, and W4 as much to the left as 
possible, placing two of them at distance 4 from v, two at distance 5, and one at distance 
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6. However, squeezing them this way will make some pairs of them coincide and form 
1-vertices, which will affect the resulting tables and inequalities. 

Suppose first that among the three 'heavier' targets x, y, w^, at most one lies at distance 
4 from V. The worst table and the associated inequality are (recall that x ^ y): 









l/4x 





4 


l/4y 





5 


l/8z 





4 


l/8t 





6 


1/4^4 





5 



19 9 

—/?>-. 
4^-4 



(17) 



Suppose then that among {w^, x, y}, two are at distance 4 from v, say and y. Then 
W4 = y is a 1-vertex, and we denote by w its unique downward neighbor. The local 
structure is: 




c=ll/4 1/4 
Two equally worst tables, and the resulting common inequality are 





aAi 


PA 










l/4x 


1 


5 




l/4x 


1 


6 


l/8z 


1 


6 




l/8z 


1 


6 


l/8t 


1 


7 




l/8t 


1 


5 


l/2w 


1 


5 




l/2w 


1 


5 



43^ 11 
« + -/?>-. (18) 



Case 2.c.iii: wi is a 2-vertex that reaches W2'i that is, one of its downward neighbors, 
say W3, coincides with W2- Then W2 is a 1-vertex, and we denote by x its unique downward 
neighbor. 

1/2 




A crucial observation is that x cannot be equal to w^. Indeed, if they were equal, then 
would be a 1-vertex. 
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In this case, cutting the edge graph G of P at the downward edge emanating from W4 
and at the edge entering v would have disconnected G, contradicting the fact that G is 
3-connected. 

We first dispose of the case where x lies lower than W4. The table and inequality are 



« + -/?>-. (19) 



In what follows we thus assume that x lies above W4. 







(3A 


3/4x 


1 


4 


1/4^4 


1 


3 



Case 2.c.iii.l: x is a 1-vertex that precedes w^. Suppose first that is the unique 
downward neighbor of x. Then 104^ is a 1-vertex, and we denote its unique downward 
neighbor by z. The local structure, table and inequality are: 




c = 4 





aAi 


PA 


z 


3 


5 



3a + 5/3 > 4. (20) 



Suppose next that the unique downward neighbor ?/ of x is not W4. The local structure, 
table and inequality look like this {y is drawn above W4 because this yields a sharper 
inequality) : 




1/2 ^-^4"' 3/4 —3/4 



c = 3 







PA 




2 


4 


1/4W4 


2 


5 



2a + ^P>3. (21) 



Case 2.c.iii.2: x is a 2-vertex that precedes 1^4. This subcase splits into several subcases, 
where we assume, respectively, that A{w4^) > 6, A{w4) = 4, and A{w4) = 5. 



Case 2.c.iii.2(a). Suppose first that A{w4) > 6. The configuration looks like this: 

1/2 




14 



The table and inequality are 







PA 


3/4a; 


1 


3 




1 


6 



15^ 9 
4 4 



(22) 



Note that this is the same inequality as (fT^ . 



Case 2.c.iii.2(b). Suppose next that A{w4) = 4, and that one of the downward neigh- 
bors of X is W4. Let z denote the other downward neighbor. W4 is a 1- vertex, and we 
denote by w its unique downward neighbor. 

1/2 

V _/ ""J. ^^^^2 ^^'""^^ — 5/8 




c = 29/8 

The 3-connectivity of the edge graph of P implies, as above, that w ^ z. Since we assume 
that A{w4) = 4, z also lies below W4, and the table and inequality are 





aAi 


PA 


5/8w 


2 


5 


3/8z 


2 


6 



43 ^ 29 
2a + —p > —. 



(23) 



Suppose next that A(w4) = 4 and W4 is not a downward neighbor of x. Denote those 
two neighbors as w and z, both of which lie lower than W4, by assumption, and are clearly 
distinct. The configuration, table and inequality look like this: 




c = 3 





aAi 


PA 


1/4^4 


1 


4 


3/8w 


1 


5 


3/8z 


1 


6 



a + ^P>3. (24) 



Case 2. c. ill. 2(c). It remains to consider the case A{w4) = 5. Let z denote the unique 
vertex lying between x and w^. We may assume that z is connected to x, for otherwise z 
is not reachable from v, and we might as well reduce this case to the case A{w4,) = 4 just 
treated. 

Consider first the subcase where the other downward neighbor of x is W4 itself. Then 
W4 is a 1-vertex, and we denote by w its unique downward neighbor. This subcase splits 
further into two subcases: First, assume that 2 is a 1-vertex, and let y denote its unique 
downward neighbor. Clearly, y must lie below W4 (it may coincide with or precede w). 
The configuration looks like this: 



15 



c = 4 



1/4 



The table and inequality are 





aAi 


/5A 


3/82/ 


3 


6 


5/8m; 


3 


6 



3a + 6/3 > 4. 



(25) 



In the other subcase, 2; is a 2-vertex; we denote its two downward neighbors as y and 
t. The vertices w,y,t all lie below and may appear there in any order (except that 
w ^ t). The configuration looks like this: 




The table and inequality are 









3/16y 


2 


6 


2,/m 


2 


7 


5/8m; 


2 


6 



2a 



99 
16 



/? > 4. 



(26) 



Consider next the subcase where Wi is not a downward neighbor of x. Denote the 
other downward neighbor of x as which lies strictly below w^. This subcase splits 
into three subcases. First, assume that 2; is a 1- vertex, and denote its unique downward 
neighbor as w. The configuration looks like this: 




The table and inequality are 







/5A 


l/4u;4 


2 


5 


3/8y 


2 


6 


3/8m; 


2 


5 



43^ 27 
2a + — /3 > — . 



(27) 



Second, assume that 2; is a 2-vertex, so that none of its two downward neighbors is 
W4. Denote these neighbors as w and t. All three vertices ?/, t, w lie strictly below W4, and 
w ^ t. The configuration looks like this: 



16 




The table and inequality are 







PA 


1/4W4 


1 


5 


3/8y 


1 


6 


3/16w 


1 


6 


3/16t 


1 


7 



95^ 27 

a H /3 > — . 

16^ - 8 



(28) 



Finally, assume that z is a 2- vertex, so that one of its two downward neighbors is W4. 
Denote the other neighbor as w. In this case W4 is a 1-vertex, and we denote its unique 
downward neighbor as t. All three vertices y, t, w lie strictly below W4. The configuration 
looks like this: 




The table and inequality are 







PA 


3/8y 


2 


6 


3/lQw 


2 


7 


7/m 


2 


6 



99^ 61 
2a + —8 > — , 
16^ - 16' 



(29) 



which, by the way, is weaker than 

This completes the case distinction. 



Thus dH) holds for every pair (a,/3) that satis- 



fies In particular, it holds for the pair (a,/3) 

the beginning of 
of random edge. 



II, II), which fas discussed at 



>87' 87^' 



the beginning of the proof) yields the upper bound ^ < 1.4943 on the linearity coefficient 



□ 



Discussion. (1) The analysis has used (twice) the fact that G is a 3-connected graph. 
Without this assumption, the linearity coefficient becomes 13/8: A lower bound con- 
struction can be derived from the figure shown in Case 2.c.iii, and an upper bound can be 
obtained along the same lines of the preceding proof, using a much shorter case analysis. 
It is interesting that the proof did not use at all the planarity of the polytope graph G. 

(2) In an earlier phase of our work, we obtained the upper bound of 3/2 on the 
linearity coefficient, using a similar but considerably shorter case analysis. Unfortunately, 
the lengthier case distinction presented in the proof above is not just a refinement of 
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that shorter one (which is the reason for presenting only the lengthier proof). The proof 
indicates that the problem probably is far from admitting a clean and simple solution - at 
least using this approach. Of course, it would be interesting to find an alternative simpler 
way of attacking the problem. 

(3) The solution {a, (3) = (||, ||) satisfies Q and (pUj) with equality. If we examine 
the configuration corresponding to and expand it further, we can replace Q by better 
inequalities, which result in a (slightly) improved bound on the linearity coefficient, at 
the cost of lengthening further our case analysis. This refinement process can continue for 
a few more steps, as we have verified. We have no idea whether this iterative refinement 
process ever converges to some critical configuration, whose further expansion does not 
improve the bound, and which is then likely to yield a tight bound on the linearity 
coefficient. 

4 Other Pivot Rules 

4.1 Bland's Rule 

For Bland's least index pivot rule |2] the facets (inequalities) are numbered. At every 
non-minimal vertex the rule then dictates to choose the edge that leaves the facet with 
the smallest number. (A special feature of Bland's rule is that it does not admit cychng 
even on degenerate programs/non-simple polytopes, when our geometric description of 
the rule is, however, not applicable.) 

Proposition 4.1. The linearity coefficient of Bland's rule is 2. 

Proof. Figure ^ illustrates a family of 3-dimensional LPs on which Bland's rule, started 
at Wstart = V2n-%, visits all but one of the vertices. (As we have already noted, the directed 
graph in the figure is readily verified to satisfy the conditions of Theorem 12. 21 ) Specifically, 
choose an initial numbering of the facets where the largest index is assigned to facet /. 
When starting at the vertex Wgtart = V2n-Q the simplex algorithm with Bland's rule visits 
the 2n — 5 vertices W2n-6) • • • , I'd- D 

4.2 Dantzig's Rule 

Dantzig's rule is the original rule proposed by Dantzig when he invented the simplex 
algorithm. In his setting of a maximization problem formulated in the language of simplex 
tableaus, the rule requires to pivot into the basis the variable that has the largest reduced 
cost coefficient (if no variable has positive reduced cost, the current tableau is optimal). 

By suitably scaling the inequalities of the LP, Dantzig's rule follows the same path as 
Bland's rule; see Amenta & Ziegler ^ Observation 2.6]. Thus Dantzig's rule cannot be 
faster than Bland's rule, and Proposition 14.11 thus implies: 

Proposition 4.2. The linearity coefficient of Dantzig's rule is 2. 
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4.3 Greatest Decrease Rule 



The greatest decrease rule moves from any non-optimal vertex to the neighbor with the 
smallest objective function value. We assume that the objective function is generic, so 
the vertex is unique. However, the greatest decrease rule may compare non-adjacent 
neighbors, so the information given by the directed graph is not sufficient to implement 
it; we rather need explicit objective function values. 

Proposition 4.3. The linearity coefficient of the greatest decrease rule is |. 

Proof. First we show that A(GD) > |. Figure IH indicates a family of 3-dimensional 
LPs. By Theorem 12.21 there is a realization of these LPs with the objective function 
linear ordering on the vertices given by the left-to-right ordering in our figure. Started at 
^start = V2n-Q, the greatest decrease rule visits all 1- vertices, the global sink, and half of 
the 2- vertices. Thus it needs \{n — ?>) pivot steps to reach v-^ain = Vq. 

1'2n-12 1^271-14 '"2n-20 

V2n-7 V2n-ll V2n-i:i l'2n-19 ""5 ""4 



V2n-6 = ''start d ^ g < 

/ "271-8 "271-9 

/ "271-10 



"271 — 5 — "max 

Figure 4: Lower bound for the greatest decrease rule. All edges are oriented from left to 
right. 




For the proof of A(GD) < |, we consider an arbitrary instance with n, P, ip, and fstart 
as above. Denote by rii and n2 the number of visited 1- and 2-vertices, respectively. 
Thus there are n — 3 — ni and n — 3 — n2 unvisited 1-vertices and 2-vertices, respectively. 
For every visited 2-vertex v only one of the two direct successors v' and v" is visited. 
Assuming that f{v') > (p{v"), the greatest decrease rule will proceed directly from v 
to v" and thus skip t;', whose objective function value satisfies (p{v) > (p{v') > ip{v"). 
Thus there is an unvisited vertex uniquely associated with every visited 2-vertex. Thus 
77-2 < 2n — 6 — ni — 77.2, which is equivalent to rii + 2n2 < 2n — 6. We get 

ni + n2 = irii + + 2ri2) < |(r2 - 3) + |(2r2 - 6) < |(n - 3). 

This yields A(GD) < | and completes the proof. □ 

4.4 Steepest Decrease Rule 

At any non-minimal vertex v the steepest decrease pivot rule moves to the neighbor w 
with vw being the steepest decreasing edge, that is, such that y^'^^yl.ii is minimal (where 
(c, x) is the objective function). 

Proposition 4.4. The linearity coefficient of the steepest decrease rule is 2. 
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Proof. Figure El depicts a planar projection onto the (xi, X2)-plane of an LP that is easily 
constructed either "by hand" or as a deformed product (see Amenta & Ziegler ^J). If 
the polytope is scaled to be very flat in the Xs-direction, then steepest decrease tells 
the simplex algorithm to use the edge that in the projection has the smallest slope (in 
absolute value). Thus starting at Ustart = V2n-5, the steepest decrease rule visits all the 
vertices. □ 



Figure 5: Lower bounds for the steepest decrease and shadow vertex rules. Planar 
projection of the polytope: The objective function is xi] it directs all edges from left 
to right. 



4.5 Shadow Vertex Rule 

The shadow vertex pivot rule chooses a sequence of edges that lie on the boundary of 
the 2-dimensional projection of the polytope given by x i— ((c, x), (rf, x)), where (c, x) 
is the given objective function, and {d, x) is an objective function that is constructed to 
be optimal at the starting vertex Wstart- The vertices that are visited on the path from 
■^start to fmin are then optimal for objective functions that interpolate between {d, x) and 
(c, x). (This pivot rule is known to be polynomial on "random linear programs" in specific 
models; cf. Borgwardt jH], Ziegler |2Ij, and Spielman & Teng [TR].) 

Proposition 4.5. The linearity coefficient of the shadow vertex rule is 2. 

Proof. We reuse the linear programs of Proposition I4.4| /Figure El Here t'2n-5 = f max is 
optimal for the starting objective function (d, x) = X2, while Vq is optimal for (c, x) = Xi. 
On the way from f2n-5 to fmin = Vq the shadow vertex rule visits all the vertices. □ 

4.6 Random Facet 

The random facet pivot rule, due to Kalai [101 p. 228], is as follows: 
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(RF) At any non-optimal vertex v choose one facet / containing v uniformly at random 
and solve the problem restricted to / by applying (RF) recursively. 
The recursion will eventually restrict to a one-dimensional subproblem (that is, an 
edge), which is solved by following the edge. 

The one-dimensional base case singled out here is only implicit in Kalai's work. This is 
probably the reason why there are different versions of this rule in the literature which 
unfortunately were not distinguished. They all differ in the way how 1-vertices are treated. 
Since the (unique) out-edge of a 1-vertex is always taken with probability one (regardless 
of which facets we restrict to) we could use the following alternative formulations of the 
random facet rule: 

(RFl) At each non-optimal vertex v follow the (unique) outgoing edge if v is a 1-vertex. 
Otherwise choose one facet / uniformly at random containing v and solve the prob- 
lem restricted to / by applying (RFl) recursively. 

(RF2) At any non-optimal vertex v choose one facet / containing v uniformly at random 
and solve the problem restricted to / by applying (RF2) recursively. The minimal 
vertex opt(/) of / is a 1-vertex and we follow the (unique) outgoing edge of the 
vertex opt(/). 

The variant (RFl) appears in Gartner, Henk & Ziegler jHl p. 350], while the version (RF2) 
is from Gartner [5], who, however, formulated this variant of the random facet rule for 
combinatorial cubes, where the formulations above are equivalent. 

Note that (RF) uses randomness at every vertex, and (RFl) would follow a path of 
1-vertices deterministically, while (RF2) takes at most one deterministic step in a row. 
This results in distinct pivot rules, with different worst case examples. 

Proposition 4.6. For each version (RF), (RFl), and (RF2) of the random facet rule the 
linearity coefficient is 2. 

Proof. Figure El depicts a family of LPs with 2n — 4 = 2a + 26 + 2 vertices and n = a + b + 3 
facets. For each of the b 1-vertices Wgtart = V2n-7, V2n-9, ■ ■ ■ , 'V2a+i, the probability of leaving 
it via choosing facet / is |. After choosing facet /, (RF) "sticks" to facet / until Va is 
reached. 

Choosing a = and b = k we obtain a family of LPs with n = k"^ + k + 3 facets. 
Then (RF) sticks to facet / with probability p > 1 — (1)*". Thus the expected number of 
visited vertices is at least 

(l-(i)^)(2a + 6) > 2A:^-^. 

Since there are n = k"^ + k + 3 facets, the linearity coefficient is 2. 

The version (RFl) of the random facet rule follows the path of 1-vertices Vstart — 
V2n-7, V2n-9, ■ ■ ■ , V2a+i dcterministically. We can cut off each of these vertices. This yields 
the graphs depicted in Figure At each source of the new facets Ai, . . . , A;,, the facet 
/ is chosen with probability |. If any of the other two facets is chosen, we end up at the 
sink vertex of the respective facet Aj. Thus the linearity coefficient remains 2, only the 
rate of convergence decreases. The same works for (RF2) as well. □ 
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^min 



Figure 7: Lower bounds for the (RFl) and (RF2) variants of the random facet rule, and 
for the least entered rule with random edge as the tie-breaking rule. All edges are oriented 
from left to right. 



4.7 Least Entered Rule 

At any non-optimal vertex, the least entered pivot rule chooses the decreasing edge that 
leaves the facet that has been left least often in the previous moves. In case of ties a 
tie-breaking rule is used to determine the decreasing edge to be taken. Any other pivot 
rule can be used tie-breaking rule. 

The least entered rule was first formulated by Norman Zadeh around 1980 (see ^3] 
and [23). It has still not been determined whether Zadeh's rule is polynomial if the 
dimension is part of the input. Zadeh has offered $1000 for solving this problem. 

Proposition 4.7. The linearity coefficient of the least entered rule with greatest decrease 
as tie-breaking rule is 2. 

Proof. Figure IHl describes a family of 3-dimensional LPs, where the left-to-right ordering 
of the vertices suggested by the figure can be realized, according to the Mihalisin-Klee 
Theorem 12.21 Starting at Vstart = V2n-6, the greatest decrease rule decides to leave the 
facet /. Following two 1-vertices the facet / is entered again. All upcoming facets have 
not been visited before. Thus the least entered rule "sticks" to the facet / and 2n — 7 
vertices (that is, all but 3 vertices) are visited. □ 

Proposition 4.8. The linearity coefficient of the least entered rule with random edge as 
the tie-breaking rule is 2. 
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fn-9 



Figure 8: Lower bound for the least entered rule with greatest decrease as the tie-breaking 
rule. All edges are oriented from left to right. 

Proof. Figure [3 describes LPs with 2n — 4 = 2a + 46 + 2 vertices and n = a + 26 + 3 facets. 
At the sources of the facets Aj the random edge rule leaves the facet / with probability 
|. As soon as / is left once, it will be revisited and the least entered rule will "stick" to 
the facet /. (Thus the only way not to "stick" to / is that the random edge rule chooses 
to continue along / until it reaches the vertex V2a-) When the least entered rule "sticks" 
to the facet / all of the 2a vertices V2a-i, ^20-2, • • • , ^^i, ^0 are visited. 

Now the analysis is exactly the same as in the proof of Proposition 14.61 Thus choosing 
a = k'^ and b = k yields that the linearity coefficient is 2. □ 

Acknowledgements. We are grateful to Emo Welzl and Giinter Rote for inspiring 
discussions and helpful comments. 
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